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Abstract In 1975 James Pickands III showed that the excesses over a high 
threshold are approximatly Generalized Pareto distributed. Since then, a vari¬ 
ety of estimators for the parameters of this cdf have been studied, but always 
assuming the underlying data to be independent. In this paper we consider the 
special case where the underlying data arises from a linear process with reg¬ 
ularly varying (i.e. heavy-tailed) innovations. Using this setup, we then show 
that the likelihood moment estimators introduced by Zhang (2007) are con¬ 
sistent estimators for the parameters of the Generalized Pareto distribution. 
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1 lutroductiou 

A random variable X has a heavy right or left tail if there exists a positive 
parameter 7 , called extremal index, such that as a; —> 00 : 

P(A > x) ^ or P(A < —x) ~ x~^^'^L{x), (1.1) 

where L{x) is a slowly varying function satisfying L{yx) ~ Lix) for any 
y > 0. Here we use the common notation g(x) ~ f{x) as a; —>■ 00 for 
lim,c_>oo g{x)/f{x) = 1. Since heavy-tail analysis is a special branch of extreme 
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value theory, there are basically two main approaches for modeling extreme 
events: 

The first approach - the more traditional one introduced by Fisher & Tippet 
(1928, [B]) - is based on the Generalized Extreme Value distribution (GEVD) 
Gj(x) := exp (—(1 + 1 + yx > 0, and is used to model block ex¬ 

tremes, i.e. maxima (or minima) observed within a pre-defined period. Actu¬ 
ally, as y is assumed to be positive in (HH), the GEVD simply reduces to the 
Frechet distribution in this special setup. 

The second and more recent approach - the one being discussed in this paper 
- is to model exceedances over high thresholds. This kind of modeling was 
introduced by Pickands (1975, [TB]) and is based on the Generalized Pareto 
distribution (GPD) H.y^CT*(x) := 1 — (1 -I- yx/cr*)”^^"’', where 1 -I- yx/cr* > 0 
and tr* = is a suitable scale function. Notice that as before, the GPD 
reduces to the Pareto distribution in case y > 0. 

Glassical studies on how to fit a GPD (see section 2 for references) usually cover 
the simplest case when the underlying data are i.i.d. Though the assumption 
of independence has its practical advantages, most collected data sets have 
non-negligible dependence structures which need to be included in statistical 
modeling. An easy but quite powerful way to model such a dependence is the 
use of so-called autoregressive moving average (ARMA) time series models. In 
detail, a stationary time series {Xn}nGi,o is called an ARMA(p,( 7 )-process if it 
has additive innovations {Z„}„gZp satisfying the following recurrence formula: 


v„ — (piXn-l — ... — (flpXn-p — Zn + Q\Zn-\ BqZ^-q, (1-2) 

with (f)i,... ,(j)p\9i,... ,9q S M and p, g € Nq. From [3], Theorem 3.1.1, it then 
follows that if (j){z) has no root on the unit circle and <j){z) and 9{z) have no 
common zeroes, the ARMA process in (11.21) has the causal representation 




1=0 


where the coefficients {cj}j>o are determined by the relation 


(1.3) 


c(^) = 1^1 ^ 1- 

1=0 

[1] studied the asymptotic behavior of linear processes (also known as MA(oo)- 
processes) given in (11.311 for the case where the innovation’s marginal cdf has 
regularly varying tails in the sense of dLU. Resnick & Starica proved that 
Hill’s estimator is a consistent estimator for y (1995, [16]) and additionally 
showed its asymptotic normality (1997, [17jl. In this paper, the aforementioned 
results shall be extended to the case where one wants to fit a GPD instead 
of simply estimating the extremal index. In section 2 we are going to fit a 
special pair of estimators introduced by Zhang (2007, jUj): the likelihood 
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moment estimators (LMEs) and analogously to [TB], we show in section 3 that 
the LMEs are consistent estimators for the parameters of the Generalized 
Pareto distribution. In section 4 we finally discuss further steps needed to 
show asymptotic normality and why the classical results for tail empirical 
processes by Resnick & Starica are more promising than the widely used and 
powerful tools established by Drees ([S]) when studying asymptotic behavior 
of linear processes. 


2 Fitting a Generalized Pareto distribution for a linear process 
with regularly varying tails 


Let 7 > 0 and {cj}j>o S and consider a linear process 

OO 

Xn = '^^CjZn-j (A..1) 

j=0 

whose iid innovations have a marginal cumulative distribution function 
(cdf) Gz having regularly varying tails with index — 1 / 7 , i.e. 

1 — Gz{z) ^ TTiz~^^^L{z) and Gz{—z) ~ ■n 2 Z ^^^L{z) as 2 : —>■ 00 , 

for 7 ri, 7 r 2 > 0, tti + 7 r 2 = 1 and a slowly varying function L{z) satisfying 
L{yz) ^ L{z) as 2 ; —OO for any y > 0. Clearly, (A. 2 ) implies 1 — G\z\{z) ~ 
z~^/^L{z), where G\z\ is the (marginal) distribution function of \Zn\. 

In order to make some statements about the marginal distribution of we 
need some restrictions on the sequence of coefficients {cj}j>o in (A.l). We 
assume that there is at least one cj ^ 0 and that for some 0 <i 5 <l/ 7 Al: 


00 


j=o 


< OO. 


(A.3) 


Notice that (A.3) always holds for causal ARMA processes as in (11.21) ([i], p. 
341). For simplicity and due to their frequent use, (IA.lll - (IA.3ll will be called 

the Basic Assumptions. 


If we denote the marginal distribution function of |A„| by F|x|, then, with 
the help of Lemma 5.2 of [1], we directly conclude that the Basic Assumptions 
imply 


t—>-oo 1 — G|2'|(^) 




Ic^r/" := 


Under mild restrictions on the coefficients cj the tail behavior of F|x| thus 
coincides with that of G\z\ up to the constant ||c|| and consequently, the 
marginal distribution of the time series |A„| has also a regularly varying tail 
with index — 1 / 7 . This fact has an important consequence: By [5], Theorem 
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1 . 1 . 6 , it directly follows that F|jf| belongs to the domain of attraction of the 
Generalized Extreme Value distribution G-y{x) = exp (—(1 + 72 ;)“^^^) and 
this simply means that there exists a suitable positive function a\x\(t) such 
that for t —>■ c»: 

bix\{tx) - bix\{t) x^-1 

7 ’ ^ ^ 

where 6 |x|(i) := (1/(1-F|x|))^(t) := inf{y ; 1/(1-F|x|(y)) > t} is the 1 - 1 /t- 
quantile of F|x|. On the other hand, following [13] , Theorem 7, there also exists 
a positive function a*{t) such that for F|x|,t(a;) := P (|V| —t<x ||V| > t): 



As anticipated in section 1, the function H.y^CT* {x) = 1 — {1 + "fx/(j*{t)) is 
known as the Generalized Pareto distribution with scale function Rela¬ 

tion (23) thus tells us, that the random variable |A| —t||A| > t, the excess 
above a high threshold t, has approximately a Generalized Pareto distribution. 
In practice, t is replaced by the (fc -I- l)th largest observation \X\n,n-k for a 
sufficiently large k << n. Consequently, the target will be to estimate 7 and 
a*{t) using the k excess-values - \X\„^ri-k, ■ • ■, \X\„^ri-k+i - \X\n,n-k- 

Up to the present day a variety of estimators have been proposed for the 
two parameters 7 and a* = such as the maximum likelihood estima¬ 

tors (EOj), the moment and probability weighted moment estimators (0), the 
likelihood moment estimators ([21]) or the goodness-of-fit estimators ([lO]). A 
nice summary is also available in [5]. 

In this paper only the likelihood moment estimators (LMEs) will be studied. 
The reason behind this choice is that the LMEs can be considered as a gen¬ 
eralization of two aforementioned estimators: If we choose r = — 7 , we’ll get 
the usual maximum likelihood equations. By choosing r = — 1, we just get 
the goodness-of-fit equations of m- As the name suggests, the calculation of 
the LMEs is based on a mixture between the maximum likelihood and the 
moment estimation method. In detail, we seek to solve the following system 
of equations for 7 and cr*: 

k—1 

- ^ log (1 + ^ {\XU,n-i - |X|„,= 7 (2.3) 

^ E (1 + ^ (l^ln.™-. - \X\^,n-k)y^^ = (1 - r)-\ (2.4) 

i=0 ^ 


where r < 1 . 

Defining 7 /cr* := 9* and inserting the first equation into the second one, this 
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task reduces to the one-dimensional problem of solving the following equation 
for 0*: 

i V exp f Uof ^- 0 -(|Vke.-.-|-U„e-e)) \ ,2.5) 

According to [H], Theorem 2.1., equation (12.51) has exactly one solution 9lme 
for r < 1/2 and r ^ 0 which can be easily computed using a Newton-Rhapson 
algorithm. (Notice that a similar procedure is performed when calculating the 
maximum likelihood estimators, but the obtained target function may have 
several roots which are difficult to identify; see 0 )- ILME then is obtained by 
inserting Olme in (USD and from there, it’s a straightforward step to derive 

^LME- 


3 Consistency 


In this section, we show that the likelihood moment estimators (the solutions 
of the system of equations (1231) and (El) are consistent estimators in case 
that our Basic Assumptions hold and provided that r < 0. 

We begin with some preliminaries: Using standard arguments and [ 8 ], Theorem 
1 . 1 . 6 , one can show that the connection between the two “auxilary” functions 
a|x| and a* from ( 12 . 11 ) and ( 12 . 21 ) is as follows (cf. [H]): 

cr(t) := cr*(6|x|(i)) = ~ a|jf|(t). (3.1) 

Combining El and El, we then simply get 

b\x\{tx) - bix\{t) x'^ -1 
cr{t) 7 

which finally yields ( 0 , Lemma 1.2.9) 


^|X|(^) 1 

a{t) 7 ’ 


(3.2) 


both convergences as t —>• 00 . As we will see, the (new) scaling function a{t) 
plays a key role throughout this section. 


Lemma 1 Let the Basie Assumptions hold. Also, let’s denote the space of 
right-continuous functions on [0, 00 ) by D[0, 00 ). Then for any z > 0 we have 
as n, k,n/k —>■ oo.' 

(a) forp.xiz,oo] := {\^^\/^x\in/k) G (z,oo]} andp.{z,oo] := 

p 

yLx{z,oo\ lJ-{z,oo], 


(3.3) 
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(b) for the ([fcs] + \)th largest absolute observation • 

\X\r,.n-fks^ P *„D[0,00), 


(3.4) 


b\x\in/k) 

where [fcs] is the smallest integer greater than or egual to ks with s > 0. 


Proof The proof of (b) is a direct consequence of (a) and can simply be looked 
up in p. 81-83. To prove (13.3L let’s rewrite our sequence Xt as 

OO OO OO 

Xt = CjZt-j = Cj sign.{Zt-j)\Zt-j \ := Cj\Zt-j\. 
j=0 j=o j=o 


Furthermore, let m G N\{0} and define by xj:^'^ := J2'jLoCj\Zt-j\ the asso¬ 
ciated (and truncated) mth order moving-average MA(m). Notice that the c'^s 
are random now, but as we will see below, one can get around this problem 
by conditioning on the first m-|- 1 innovations {sign(Zt__, )}^Q. 

The proof of (13.311 is now set up in two parts: First we are going to show that 

for fj,^^\z,oo] := X)r=i ^ £ ( 2 , oo]|, the random measure 

of the truncated time series, we have 


M, , p , , 

'(z,CO| -> I 11 /, 

2^j=o I'"''' 


(3.5) 


^j=0 I 

as n, fc, n/k —>• cx). By showing (13.51) . the proof of (13.31) is straightforward then. 


Denote the set of all possible (and disjoint) random sequences {sign(Zt_j)}j>o 
by & 00 - A single element of this set is further denoted by sl°°\l > 1. Define 

m 

j=0 


a MA(m) process with positive iid innovations \Zt-j\ consisting of determin¬ 
istic constants c'D that equal either to Cj or —c^, according to Finally 

let 

OO 

A = Xt,i := lim X^f = Vc",|Zt_,|. 

m—>00 ^ 

3=0 

If we denote the distribution function of X and —X by F^^ and F_jjf, respec¬ 
tively, we deduce from |16) . Proposition 3.2, that for any m and a fixed I we 
have as n, k,n/k —>■ 00 : 


1 

k 


El 


y(m) 

b^{n/k) 


G 



P 
—)► 


T.u{i<=hP'Y 

Er=o(KT'")^ 


n{z,oo], 


(3.6) 
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1 

k 


El 


b_xin/k) 


e 



P 


E,"L. 

Er=o {K-V'^) 


fi{z,oo], 


(3.7) 


where bj^{t) and b_^{t) denote the 1 — 1/t-quantiles of and F_j^, respec¬ 
tively, and x'^ := max(x, 0), x~ := — min(a;, 0). 

The goal is now to rescale the empirical processes in dim and (liTfl) in order 
to obtain new processes that depend on b\x\{n/k). Using Lemma 5.2 of [3], it 
follows that for n, k,n/k oo: 


1 — F x{n/k) 
1 - G|z|(n/fc) 


OO 

E 

j=0 




l-F_^(n/fc) 
1 - G|z|(n/fc) 


OO 

V i: (Ky'^’) 


1 - F\x\{n/k) 

1 - G|2|(n/fc) 


OO 


i=o 


(3.8) 

(3.9) 
(3.10) 


Repeating twice the final steps of the proof of Proposition 3.2 of m, » 
(13.101) imply 


b\x\{n/k) 

(Er.o(Ky‘''')")^'' vi(„A) 

(Er=o (Ky'’)")^’ 

bxin/k) 




both convergences as n, k,n/k ^ oo, and from (13.61) and (13.7L we finally get 

^ V"‘ „ ( U",1V7 


1 


El 


y(m) 


b\x\{n/k) 


G (z,oo] 


j:T=o\cp/^ 


■n{z,oo], 


El 


-X 


(m) 


b\x\{nlk) 


G (z,oo] \ -S- 


EE 




Hence we conclude that for any m and for arbitrary I as n,k,n/k —>■ oo: 


/ig(z,oo] :=;^El 


\xlf\ 


b\x\{n/k) 


G {z, oo] 




j=0 \^3 


E oo 

i—' 


T77M(^:,oo]- 


(3.11) 


The convergence in (|3.11l) thus shows that the limit of (-^i oo] as n, k,n/k —>■ 
oo does not depend on I any more because we only need to consider the abso¬ 
lute value of the |cj|s. 
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Next consider the set &m of random sequences {sign{Zt-j)}JLQ with m € 
N\{0}. Gm consists of 2”^+^ elements and we will denote a single element by 
1 < < 2™+^. Clearly, for every there exists at least one element 

in ©oo, say such that and it follows 

from (13.111) that for any 1* and any e > 0: 


P 

= P 


= P 


= P 


, E”lol<=,i''’ , , 

Ey (2. oo) - /‘(s, oo] 

'•'S 1 l>|Y|("/*) ' I 


> £ 


5, 


(m) 


ET=0 




fj.(2 


El 

2=1 


14 "’ 14 °°’ 

bixi(n/k) 


-5 - e (Z, oo] > - |„ n/-. oo] 


Er=olorf/" 


w, , Er=oio.f^" 


fj,(z, oo] 


> £ I —^ 0, 



n^k,n/k oo. By an application of the formula of total probability it then 
follows that for any m G N\{0}: 


, E" a Ic,!'” 


n{z, oo] 


> £ 


2^71+1 

= Ep 




< supP 


M, , Er=olOrr/" , , 

^J■X oo] - ^oo I 11 /.^ oo] 

2^j=o I Oil 

M, , Er=olorr/" , , 

k-x oo] - ^oo I n/.^ oo] 

2^j=o I Oil 


> £ 


> £ 



0 , 


n, k, n/k —i oo. This proves (13.51) and hnishes the first part of the proof. 

To show that the convergence in (|3.3D holds, it now suffices to check whether 

lim limsup P( f/^\z,oo] — fj.x{z,oo] > £ ) = 0. 

n,k,n/k^oo ^ > 

([Is], Theorem 3.5). Based on the proof of [16], Proposition 3.3, define the 
events 


A, := <^ 0 < 


B,; := <^ 0 < 


1^, 


(m). 


l^.l 


\x\{nlk) 5|jc|(n/fc) 
\x\{nlk) 5|x|(n/fc) 


< (5 


< (5 
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a- := 


1^. 


(m)| 




><5 


hx\{n/k) b\x\{n/k) 
for some 0 < ^ < Then for m € N\{0} and e > 0: 


^^^(2:,oo] - ^x(2:,cx)] > e) 


S'’ lE 


Z=1 


1^. 


(m)| 


b\x\{n/k) 


G (z,oo] ^ - 1 




S'- rE 




1^1 


( to ). 


b\x\{n/k) 


G (z,oo] ^ - 1 


h\x\{n/k) 

l^.l 


b\x\{n/k) 


G {z, oo] 

G (2,00] 


> e/3j 
Isi > e/3 




2=1 




2=1 




( to ). 


3n / I 

:= Ii + I 2 + la- 


b\x\in/k) 

l^.l 


b\x\{n/k) 


G (z, 00] j - 1 
G (2, 00] i - 1 




b\x\{n/k) 

f 1X^1 


bixi(n/k) 


G (2:,oo] 

G (z,oo] 


U, > e/ 3 j 
1 b. > e /3 


1^1 


( to ). 


1^11 


^|x|(b/A:) bixi(n/k) 


> S 


Here Chebyshev’s inequality was used in the second last line. Now 


'.sh/E 


l-’f, 


( to ) 


6|x|(n/fc) 


G (2;,oo] ^ - 1 




bixi(Wk) 


G (z + S, 00] 


> e /3 


and since we know from (ESI) that 

r(TO)| 'I 


i Vi|^ 

Z? f ^ I hi -.^1 / 




G (z,oo] \ - 


, n ( |-^( to )| 

IViJ I 

h I hi T-n /Z 




G {z + d, 00] 


p Er=oic,r/" / _ 
Er=oic.r/" 


- (z + 


we conclude Ii —)► 0 as n, fc, n/fc —>■ 00 letting 6 := ( 5 (e) \ 0. 
Similarly 




l-x, 


( to ) 


b\x\in/k) 


G (z — ( 5 , 00] > — 1 


1^1 


( to ). 


^|X|{B/fc) 


G (2,00] 


> e /3 
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and by another application of (E3) we conclude I 2 —1 0 as n, fc, n/fc —1 00 
letting again i5 := S{e) \ 0. Finally, for 5' > 0 and n, k,n/k large: 


II3I 


3 n 
e k 


(||X«|-|Xi| 


> (56|x|(n/fc)) < 




Cj Zi — j 


m+l 


> Sb\x\{n/k) 


3 P(|Sm+iCj^i-j| > 6bix\{n/k)) ^ 3 P (|Em+i > S'biz\in/k)) 

s P {\Zi\> biz\{n/k)) ~e P {\Zi\ > b\z\{n/k)) 


q 00 
m+l 


as n, fc, n/fc —1 00 , where we used the fact that ||a — 6| — |a|| < |&| for a, 6 G R in 
the first line, k/n ^ P(|^i| > h\z\{'n/k)) (Theorem 3.6 of [15]) and h\x\{'n/k) ^ 
const. &| 2 |(n//c) ([5], Lemma 5.2, and [13], Proposition 0.8(vi)) in line 2, and 
again |3], Lemma 5.2, in line 3. Since the last sum converges to zero as m —1 00 , 
the proof of (13.31) is complete. □ 


We next study the behavior of the tail empirical measure of \Xi \ — \X\n,n-k, 
I < i < n. 

Lemma 2 Let the Basic Assumptions hold and let a{t) be defined as in i3.1\) . 
then, as n,k,n/k —1 00 , we have for z > 0; 

-Y,l{\X,\-\X\r,,n-k>o{nlk)z} 4 (27 + 1 )- i / 7 ^ ( 3 . 12 ) 

i=l 


Proof If z = 0, the proof is trivial and hence we lay our focus on the case 
where z > 0. By (13.21) and (13.41) . we have that 

alnjk) \X\n^n-k p 1 
b\x\{n/k) zbix\{n/k) ^ z’ 

n,k,n/k —i oo, and together with (13.3L we get the following joint convergence 
in probability: 




a{n/k) 

bxinjk) 


|-^|n,n—fc \ 
zb\x\{n/k)) 


p 


^^(z,oo],7 



m, Theorem 4.4). 

The final step then is to make use of the operator 


(3.13) 


T : 


—¥ . 


', T{p.x{z,oo],x) fix{zx,oo]. 


Since T is continuous in (ir(z,oo],x) (see [13], proof of Proposition 2.1), a 
simple application of the continuous mapping theorem on (13.131) yields 


T fix{z,oo], 


(j{n/k) \X\n,n-k 


bx{n/k) z^x\{n/k) 
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1 


\Xi\ a{n/k)z + \X\n,n-k 


\^\x\in/k) 


> 


b\x\{n/k) 


= T X] ^ “ l^|n,n-fc > cr{n/k)z} 


T ( ^( 2 , 00 ], 7 + - ) = ^( 27 + 1 , 00 ]. 


□ 


The next result gives the link between the tail empirical measure and a gen¬ 
eralized form of the two tail array sums involved in the likelihood moment 
equations. 

Lemma 3 Assume that the Basic Assumptions hold and let a[t) be defined 
as in Ely. Then, as n, k,n/k —>■ 00 , we have for r < 0 and x,y > O.' 


1 


fc-i 




i=0 


'yx 


P 


(j(rilk) 

dz 




( 1 ^ |n,n—2 fc) 

:= ^i(x), 


(3.14) 


fc-i 




2=0 


7X 


a[n/k) 


(|-^|n,n—2 A:) 


Vy 


p r 


dz 


y Jo (f +1 ){z + ly-^/y 


-h 1 := 'ip2ix,y). 


(3.15) 


Proof We are going to prove (I3.15|) . (13.141) can be shown using the very same 
steps. 

By some straightforward steps and denoting := max(X, 0), it is easy to 
see that 




r/y 




i=l 


'yx 


r{n/k) 


{\x,\- \xy,n-ky 


I ” r7a:(l^i|-|-^ln,n-(c) + /o'(n/fc) 

~ ^ hi y Jo 


'Iv 


dz 


[z + iy-'-Zy 


n — k 


+ 1 


poo -I fJ' r 


dz 


y(n/k)i. 

'yx I {z + iy-’'/y 


-hi. 


Xn,x,y 
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Next define Xri.x.2/ (i) /q I Li=i ^ { cr(n/fe) \^\n,n-k) > ( 2 +i^i-r/H 

for t < oo. Notice that the mapping /(•) f* f{s)/{s + ds is continu¬ 

ous (we are integrating over a finite region) and it follows from (13.121) and the 
continuous mapping theorem that for all t: 


Xn 


At) 


p 


dz 


(f+ 1)'/^ {z + l)^-/v 


The result thus follows again if we are able to show, that for any e > 0; 


lim limsup P{\Xn,x,y - Xn,x,y{t)\ > s) =0. 

n,k,n/k—^oo 


Pick (5 > 0, e' < 1 and 0 < e" < llj — rjy. Then there exists a pair (no, ko) = 
{no{S, e', e”), ko{S, e', e”)) such that for all n > uq and k > ko: 


= P 

< P 


< p 


< e" 


P{\Xn,x,y Xn,x,y{t)\ > e) 

( za{n/k) \X\n,n-k 


't \ixb\x\{n/k) ' b\x\{n/ky \{z + l)'^-^/y 

dz 


, oo 


dz 


> e 


yx{zy-£')/x+{l-5),oo\ 

T 3 ( l^|n,n—fc 


{z + y^-'-Zy 


> £ 


< 1-5 


\b\x\{n/k) 

y,x{y-5-£'){z/x + l),oo] 


dz 


(z + 


> e ) -I- o(l) 


-5-£') 


< const. 


0 ( 1 ) 


dz 

(7+TjT^^ 


o(l) 


dz 


(z-ba::)i/7-e" . (z + 1)1-’'/^ 

const. • -f o(l). 


where we used Chebyshev’s inequality in line 6 and Potter’s inequality ([5]) in 
line 7. Hence for n,k,n/k large: 


lim limsup Pi\Xn,x,y - Xn,x,y{t)\ > £) 
n,k,n/k—^oo 

< lim limsup const. • -|-o(l) = 0. 

n,k,n/k—>-oo 


□ 
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Notice that for the special cases where x = 1 and y = 7 , the integrals in 
(Id.l4|l and (13.1511 are easy to calculate so that 1/^1 (1) = 7 and 1 /^ 2 ( 1 , 7 ) = 
' 02 ( 1 ,'0i(l)) = 1/(1 ~ ^)- This will be essential in our main theorem which 
follows next. 

Theorem 1 Assume that the Basic Assumptions hold and let a(t) be defined 
as in a. Then for r < 0, we have as n, k,n/k —> oo.' 

- (()■ 

Proof We are going to show that 0LMEld{njk) 1 as n,k,n/k ^ 00 , where 
Olme is the (unique) solution of equation (12.511 and 9{n/k) := ^jafnlk). If this 

convergence h olds then 7 = i log(l + ^lme (|X|„_„_i - |X|„_„_fc)) 4 
01 ( 1 ) = 7 by (13.1411 and the continuous mapping theorem and hence we have 

^ p 

that aLME/cr{n/k) = jlme/j • 9{n,/k)/9LME —>■ 1 as n,k,n/k 00 . The 
joint convergence in (13.1611 finally follows by [1], Theorem 4.4. 

Define Yi := Y := {Fq, Fi, ■ • ■, Tfc-i} and for x, y > 0: 

0i(x,F) := {l/k)Y!lZo log(l+x6»(n/fc)F,) and 02(a::, y, V) := {l/k)Y!lZoiZ + 
x9{nlk)YiY/y. 

Now, from Result 3.3 and again Theorem 4.4 of [I], it easily follows as n, k, n/k 
—>■ 00 that 

p 

( 0 i(x,F), 02 (x,y,F)) ^ ( 0 i(x), 02 (x,y)). 

Next consider the operator 

T : R+ X R+ ^ K+ , f (0i(x, F), 02 (a;, y, Y)) ^ 02 (x, 0i(x, Y),Y). 

Since both 0i(x, F) and 02 (x,y, F) are continuous in x,y > 0, T is also 
continuous and hence by the continuous mapping theorem: 

p 

02(x,0i(x,F),F) 02(x,0i(x)), 

which clearly yields 

0(x,F) := 02(x,0i(x,F),F) - ^ 4 02(x,0i(x)) - ^ := 0(x) 

(1-4 (1-4 

(3.17) 

as n, k^n/k —>■ 00 . Notice thereby that 0(x, F) is a strictly decreasing function 
in X G ( 0 , 00 ) (cf. | 21 ) . proof of Theorem 2.1) and according to Lemma 4, its 
limit function 0 (x) has the very same property. 

Recall from (13.211 and (13.411 that 9{n/k)Yo = jYola^njk) ^ Yo/b\x\{n/k) 4 00 
as n,k,n/k —)• 00 . Hence there exists no solution 9lme of (12.511 such that 
9LME/9{n/k) converges to a negative value in probability. 

^ p 

Let’s consequently assume first that 9LME/9{n/k) —>■ a G (0,oo). Obviously, 
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the mapping i—t /(^in/k), Y) is continuous since ipix, Y) 

is continuous in x G (0,oo) and from (I3.17L it follows once more by pQ, 
Theorem 4.4, and the continuous mapping theorem that 

i;{eLME/0{n/k),Y)-i;{a)^0, (3.18) 


n, k, n/k oo. But by definition of 9lme (see (12.51) 1. ip{^LME/9{n/k), y) = 0 
and thus the convergence in (I3.18|) holds if and only if a = 1 because only in 
that case = 0 by the monotonicity of ip- 

Thus, the proof is complete if we are able to show that neither 9LME/9{n/k) — 
0 nor 9LMEl9{nlk) —> oo. 

For the first case, use again the monotonicity of ipi^x.Y) to see that for any 

£ G (0,1): 


lim V ( 0LME/9{n/k) 

n/k—^oo \ 


n,k,n/k—^cio 

lim 

n,k,n/k—^cio 


> e = 


lim P (0LME/0{n/k) > e) 

n/k—^oc \ / 


n,k,n/k—^oc 


lim pUieLME/0in/k),Y) <^i£,Y)) = lim P(0<V'(e,y)) 

n/k—^oo \ / n,k,n/k—^oc 


= 1 , 


because 'ijj{£, Y) —>■ ■)/)(£) > 0. Hence 


P 


9LMEl9{nlk) 


< £ 


0 


as n, k,n/k oo. 

Similarly for the second case: 


lim P 

n,/c,n/fe—foo 


(^dLME/d{n/k) >e 


lim P(0 < V'(e“\y)) =0, 

n,k,n/k—¥oo 


because ip{£ ^,y) ip{£ ^) < 0 by the monotonicity of tl){x,Y). This con¬ 
cludes the proof. □ 

Lemma 4 ^(x) defined in is a strictly decreasing function in x € 

(0, oo). 

Proof Let '0(x, Y) be defined as in (13.171) . Then the aim is to show that 
^'(x, Y) := dip(x,Y)/dx converges in probability to a negative limit for all 
X > 0 as n, fc, n/fc —)• oo. 

Recall that Yi = \X\n^n-i — \X\n,n-k and denote '■= log(l + x 9{n/k)Yi) 
and Zx := i (which in fact is equivalent to ipi{x,Y) in (13.141) 1 for 

i = 0,1,..., fc — 1 and x > 0. Notice thereby that Zi^^ > 0 for all i and x > 0 

and furthermore Zi^^ > if * < j. Also, Z^ —t V^i(x) as n, k,n/k ^ oo by 
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(EH- 

Further recall by (13.21) and (13.411 that for / S {1, 2, 3,4}: 

6»(n/fc)Fpfe.;/5l - Vpfe.;/5l/6|x|(«/fc) -?• I T I - 1, 


and thus, by a simple application of the continuous mapping theorem, 


'^\k-ijh'\,x —^ log + a; 


(3.19) 


as n, k,n/k oo. 

Now, going along the steps of the proof of Theorem 2.1 in [21], 'tp'{x,Y) may 
be expressed in the following way: 

'ip'{x,Y) = Z~‘^k~‘^ ^ 

0<i<j<k-l 

where u{a) := r Z^)jx is strictly increasing and u(a) := (1 — exp(a))/a 

is strictly decreasing in a. Hence Y) < 0 for all a: > 0. If we define the set 
■= {r^/5l < * < r2fc/5], [3^/5] < j < |"4fc/5]}, then, by straightforward 
steps: 

i’'{x,Y) 

< fc~^y^max|Z,,a,Zj-,a;/Z| • {u{Zi^x) - u{Zj^^)) ■ {v{Zi^^) - v{Zj^^))} 

-- 25“^ max {Z^^^Zj^^/Z'^ ■ {ui^Z^^^) - u{Zj^^)) • {v{Zi^^) - v{Zj^^))} 

< 25“^ min min {u{Z,^x) - u{Zj^^)} max {v{Zi^x) - v{Zj^x)] 

ij 

= 25“^ {2'p2fe/5].a;^r4fe/5l.a:/^x} {u{Z ^2k/5'] ,x) “ u{Z ,x)} 

■ {v{Z^ 2 k/b'],x) - v{Z^^k/b],x)] '■= 25“^(li • I 2 • la). 

Using the fact that both u{a),v{a) are continuous in a > 0, (13.141) . (13.1911 and 

P 

several applications of the continuous mapping theorem simply yield li 

P P P 

const. > 0, I 2 ^ const. > 0 and I 3 —>■ const. < 0 so that li • I 2 • I 3 const. < 0 
as n, fc, n/A: —>■ 00 . □ 









16 


Lukas Martig, Jiirg Hiisler 


4 Concluding remarks and outlook 

At first glance it may seem a bit unconventional to use the classical results of 
Resnick & Starica rather than a well-known set of powerful tools established 
by Drees ([3) when studying asymptotic behavior of linear processes. To our 
misfortune, it is a very tough question whether Drees’ main Theorem 2.1. 
can be applied to MA(oo)-processes as the author himself admits that one 
of its conditions “is more complicated to check [...] for general linear time 
series” (p. 635). Nevertheless, he was able to show that all conditions hold 
for an AR(l)-process (section 3.2), a result which was recently extended to 
arbitrary AR(p)-processes, p G N\{0}, by Kulik et al. ([I2)- It thus would 
be of great interest to extend these results once more to the whole class of 
ARMA processes. 

Of course, the next step will be to show asymptotic normality for ^lme and 
olme- a very promising way to do so is to use and to extend the theory 
of tail array sums established by [18], [19], which both served as basis for 
Resnick & Starica’s studies on asymptotic normality of Hill’s estimator for 
autoregressive data m)- Also, a simulation of the finite sample behavior of 
the LMEs would be very useful since the condition n,k,n/k —^ oo is only 
fulfilled if one really collects a very large amount of data - a well-known 
problem when using intermediate order statistics like \X\n^n-k- 



Title Suppressed Due to Excessive Length 


17 


References 

1. Billingsley, P.: Convergence of Probability Measures. Wiley, New York (1968) 

2. Bingham, N., Goldie, C., Teugels, J.: Regular Variation, Encyclopedia of Mathematics 
and its Applications. Cambridge University Press, Cambridge (1987) 

3. Brockwell, P., Davis, R.: Time Series: Theory and Methods, 2nd edition. Springer, New 
York (1991) 

4. Datta, S., McCormick, W.P.: Inference for tail parameters of a linear process with heavy 
tail innovations. Ann. Inst. Statist. Math. 50, 337-359 (1998) 

5. Drees, H.: Extreme quantile estimation for dependent data, with applications to finance. 
Bernoulli 9(1), 617-657 (2003) 

6. Fisher, R., Tippet, L.: Limiting forms of the frequency distribution of the largest or 
smallest member of a sample. Proc. Cambridge Philos. Soc. 24, 180-190 (1928) 

7. Grimshaw, S.D.: Computing maximum likelihood estimates for the generalized Pareto 
distribution. Technometrics 35, 185-191 (1993) 

8. Haan, L. de, Ferreira, A.: Extreme Value Theory. Springer, New York (2006) 

9. Hosking, J.R.M., Wallis, J.R.: Parameter and Quantile Estimation for the Generalized 
Pareto Distribution. Technometrics 29, 339-349 (1987) 

10. Hiisler, J., Li, D., Raschke, M.: Estimation for the Generalized Pareto Distribution 
using Maximum Likelihood and Goodness-of-fit. Comm. Statist. Theory and Methods. 
40, 2500-2510 (2011) 

11. Hiisler, J., Li, D., Raschke, M.: Tail index estimator using maximum likelihood and 
moment estimation. To appear in: Comm. Statist. Theory and Methods (2015) 

12. Kulik, R., Soulier, Ph., Wintenberger, O.: The tail empirical process of 
regularly varying functions of geometrically ergodic Markov chains. Preprint: 
http://arxiv.org/abs/1511.04903 (2015) 

13. Pickands, J.: Statistical Inference using Extreme Order Statistics. Ann. Statist. 3, 119- 
131 (1975) 

14. Resnick, S.: Extreme Values, Regular Variation and Point Processes. Springer, New 
York (1987) 

15. Resnick, S.: Heavy-Tail Phenomena. Probabilistic and Statistical Modeling. Springer, 
New York (2007) 

16. Resnick, S., Starica, C.: Consistency of Hill’s estimator for dependent data. J. Appl. 
Prob. 32, 139-167 (1995) 

17. Resnick, S., Starica, C.: Asymptotic behaviour of Hill’s Estimator for Autoregressive 
Data. Comm. Statist. Stoch. Models 13(4), 703-721 (1997) 

18. Rootzen, H., Leadbetter, M.R., Haan, L. de: Tail and quantile estimation for strongly 
mixing stationary sequences. Technical Report 292, Center for Stochastic Processes, De¬ 
partment of Statistics, University of North Carolina, Chapel Hill, NC 27599-3260 (1990) 

19. Rootzen, H., Leadbetter, M.R., Haan, L. de: On the distribution of tail array sums for 
strongly mixing stationary sequences. Ann. Appl. Probab. 8(3), 868-885 (1998) 

20. Smith, R.L.: Threshold Methods for Sample Extremes. In: Statistical Extremes and 
Applications, J. Tiago de Oliveira (Ed.). D. Reidel, Dordrecht, 621-638 (1984) 

21. Zhang, J.: Likelihood Moment Estimation for the Generalized Pareto Distribution. Aust. 
N.Z. J. Stat. 49, 69-77 (2007) 





